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(57) Abstract 

The present invention 
relates to a keratinocyte growth 
factor fragment, KGFdai-23, that 
is composed of a portion of an 
amino acid sequence of mature, full 
length keratinocyte growth factor, 
KGF163- The portion possesses at 
least a 2-fold increase in autogenic 
activity as compared to a mature, 
recombinant keratinocyte growth 
factor, rKGF, but lacks a sequence 
comprising the first 23 amino acid 
residues, C-N-D-M-T-P-E-Q-M- 
A-T-N-V-N-C-S-S-P-E-R-H-T-R- 
of the KGF 163 N-terminus. The 
present invention also relates 
to a DNA molecule encoding 
KGF&si-z), an expression vector 
and a transformed host containing 
the DNA molecule, and a method of 
producing KGF^i.23 by culturing 
the transformed host The present 
invention further relates to a 
conjugate of KGF<ksi-23 and a 
toxin molecule, and the use thereof 
for treatment of byperproliferative 
disease of the epidermis. Moreover, 
the present invention relates to a 
therapeutic composition containing 
KGFdai-23 and a pharmaceutically acceptable carrier and 
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the use thereof for wound healing purposes. 
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A TRUNCATED KERATINOCYTE GROWTH FACTOR (KGF) 
HAVING INCREASED BIOLOGICAL ACTIVITY 

5 FIELD OF T HE INVENTION 

The present invention relates generally to keratinocyte growth factor ("KCF"). 
More specifically, this invention relates to a truncated KGF fragment and analogs thereof 
having increased biological activity and decreased cytotoxicity as compared to a 
10 recombinant, full length, KGF expressed in an insect cell system. This KGF fragment, 
designated herein as KGF^j.^ lacks the first 23 amino acid residues of the N-terminus of 
mature full-length KGF, this N-terminus that includes a glycosylation site at amino acid 
residues 14 to 16 from the N-terminus as indicated in Finch, P.W. et al Science 245:752- 
755 (1989) and was previously believed to confer upon KGF its epithelial cell specificity. 

15 

BACKGROUND OF THE INVENTION 

KGF belongs to a family of fibroblast growth factors C , FGFs w ), the prototype of 
which is represented by basic FGF ("hFCF"). KGF is, hence, also known as FGF-6. 

20 Like other FGFs, KGF is a heparin binding protein, but unlike other FGFs, it has a 

unique target cell specificity. In particular, FGFs are generally capable of stimulating the 
proliferation and differentiation of a variety of cell types derived from the primary or 
secondary mesoderm as well as from neuroectoderm. KGF is similar to other FGFs in its 
ability to stimulate epithelial cell proliferation, but is dissimilar to other FGFs in its 

25 inability to stimulate endothelial cells or fibroblast proliferation, as discussed in Finch, 
P.W. et al.(loc. cit.) 

FGFs, including acidic fibroblast growth factor ("aFGF") and basic fibroblast 
growth factor ("bFGF"), are known to have heparin-binding properties and have the 
ability to induce the differentiation and proliferation of ventral, as well as dorsal 

30 mesoderm in early blastulae, as discussed in Gospodarowicz et aL, Cell BioL Rev. 
25:307-314 (1991), and Basilico et al^Adv. Cancer Res. 59: 115-165 (1992). The 
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response of cells to FGF is mediated through binding thereof to cell surface receptors 
("FGFRs"), of which there are three inter-related types, as discussed in Hou et al. t 
Science 257:665-668 (1991). High affinity FGFRs are tyrosine kinases and include the fig 
receptor ("FGFR-1"), the bek receptor ("FGFR-2"), and the K Sam receptor ("FGFR-3"), 
5 as discussed in Lee et al f Science 245:57-60 (1989); Dionne et al, EMBO £2685-2692 
(1990); Miki et al., Science 257:72-75 (1991); Miki et al, Proc. Natl Acad. Set USA 
«£246-250 (1992); and Dell et al., J. Biol Chem. 267:21225-21229 (1992). 

Both FGFR-1 and FGFR-2 are widely expressed in mesodermal and 
neuroendodermal tissues, and both are able to bind aFGF and bFGF with similar 

10 affinities. FGFR-3 is a KGF receptor that is specific to epithelial cells. It is an 

alternative transcript of FGFR-2. In contrast to FGFR-2, which shows high affinity for 
both aFGF and bFGF and no affinity for KGF, FGFR-3 binds KGF and aFGF with an 
affinity approximately 20 to 1000 fold higher than bFGF, as discussed in Miki et al. (loc. 
cif.), and Dell et al. (loc. at.) 

15 The tightly restricted activity profile of KGF to epithelial cells is desirable, for 

example, as in many types of wound healing applications as well as in the treatment of 
hyperproliferative diseases of the epidermis such as psoriasis and basal cell carcinoma. 
Presently, except for KGF, no highly suitable mitogenic factor exists for these 
applications. It would be desirable, therefore, if KGF could be modified to increase its 

20 potency and decrease its cytotoxicity, for therapeutic applications. 

Recently, Ron et al., J. Biol Chem. 268: 2984-2988 (February 1993) found that 
when KGF 1U was expressed in a T7 prokaryotic expression system, a recombinant KGF 
("rKGF") polypeptide could be obtained that possessed mitogenic activity. When the 
rKGF molecule was truncated by deletion of 3, 8, 27, 38, and 48 amino acid residues, 

25 respectively, from the N-terminus of the mature KGF I63 polypeptide, biological activity of 
the resulting molecules varied. With deletion of 3 and 8 amino acid residues, 
respectively, the mitogenic activity of the resulting molecules did not appear to be affected 
as compared to full-length rKGF ("rKGF 163 "). Deletion of 27 amino acid residues, 
however, resulted in 10-20 fold reduced mitogenic activity. Deletion of 38 and 48 amino 

30 acid residues, respectively, resulted in complete loss of mitogenic activity and heparin 
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binding ability. Thus, Ron et al. failed to produce any truncated KGF fragments that 
possessed increased mitogenic activity as compared to the rKGF l63 molecule. 



SUMMARY OF THE INVEN TI ON 

5 

One of the objects of the present invention is to provide a KGF fragment that 
contains a portion of an amino acid sequence of mature, full-length KGF, KGF 163 , and 
possesses at least a 2-fold increase in mitogenic activity as compared to the full-length 
recombinant KGF, rKGF 163 A particular objective is to provide a KGF fragment that lacks 
10 a sequence comprising the first 23 N-terminus amino acid residues, C-N-D-M-T-P-E-Q- 
M-A-T-N^-N-C-S-S-P-E-R-H-T-R-, of rKGF l63 . 

Another one of the objects of the present invention is to provide a KGF fragment 
that has decreased cytotoxicity as compared to rKGF l63 . Still another object is to provide 
a conjugate that comprises the KGF fragment described above and a toxin molecule. The 
15 toxin molecule can be at least one of ricin A, diphtheria toxin, and saporin. 

Yet another one of the objects of the present invention is to provide a therapeutic 
composition that contains the KGF fragment, as described above, and a pharmaceutical^ 
acceptable carrier, for example, one suitable for topical application to human skin. 

Still another one of the objects of the present invention is to provide a DNA 
20 molecule that is composed of a nucleotide sequence that encodes the KGF fragment 
described above. 

Yet another one of the objects of the present invention is to provide an expression 
vector that contains the DNA molecule that encodes the KGF fragment, described above, 
and a regulatory sequence for expression of the DNA molecule. The expression vector 
25 can be, for example, a baculovirus. 

Yet another one of the objects of the present invention is to provide a host cell 
transformed with the expression vector described above. The host cell can be, for 
example, a bacterial cell, a yeast cell, a mammalian cell, or an insect cell. 

Yet another one of the objects of the present invention is to provide a method of 
30 producing the KGF fragment by culturing the transformed host cell, as described above, 
and isolating the KGF fragment from the culture. 
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SOU anoth r one of the objects f the present invention is to provide a method of 
simulating epithelial cell growth by applying the KGF fragment to an area in which 
epithelial cell growth is desired and allowing the cell to grow. 

StiU another one of the objects of the present invention is to provide a method for 
wound healing by applying the therapeutic composition described above to an area of a 
wound to be treated and allowing the wound to heal. 

Still another one of the objects of the present invention is to provide a method of 
treating a hyperproliferauve disease of the epidermis by applying the conjugate described 
above to an area to be treated. 

Further objects, features, and advantages of the present invention would be 
apparent to a person of ordinary skill in the an and need not be enumerated here T*e 
present invention includes variants and modifications of the above that are apparent to a 
person of ordinary skill in the art and that do not substantially alter the nature and activity 
of the fragment, conjugate, therapeutic composition, vector, host, and methods of use. 

SUMMARY n F T HE T)R A Wiwnc 

FIG. 1 shows the amino acid sequence of the mature, full-length human KGF 
polypeptide, beginning with the N-terminus and containing 163 amino acid residues and a 
single N-linked glycosylation signal at amino acid residues 14-16. 

FIG. 2 shows the P AcC13 expression vector into which the 163 amino acid 
sequence of KGF has been inserted. 

FIG. 3 illustrates the further purification of the KGF fragment of the present 
invention from the SF9 cells conditioned medium after elution in a Heparin Sepharose 
(-HS-) column with 1 M NaCl. lUe bioactive fractions from the HS column were 
pooled, diluted 5-fold with 10 mM Tris, pH7.3, and loaded onto a Mono S HR5/5 cation 
exchange FPLC. 

FIG. 4 compares the biological activity of the long, i.e., rKGF, 63 , and the short, 
i.e., KGF^ljj, form of KGF versus aFGF on cells of the Balb/Mk cell line. 



WO 95/01434 



PCT/US94/04694 



5- 



FIG. 5 compares the biological activity of the long, i.e., rKGF l63 , and the short, ' 
i.e., KGT^, form of KGF versus bFGF on vascular endothelial cells derived from 
large vessels (A:ABAE cells) or capillary (B:ACE cells). 

PETAfLED DESCRIPTION OF TWIT, P REFERRFX) FMBOprMTTNT g 

It has been surprisingly discovered that a truncated, unglycosylated KGF, referred 
to herein as the KGF fragment or KGF^,^, having a deletion spanning the first 23 N- 
terminus amino acid residues of rKGF I43 , possesses a greater biological activity and 
decreased cytotoxicity on epithelial cells as compared with the rKGF, 5J . Generally, the 
KGF fragment of the present invention retains the specificity of KGF 163 for stimulation of 
epithelial cell proliferation. 

Accordingly, a preferred embodiment of the present invention is a novel 
truncated, unglycosylated KGF fragment, KGF tol . B , unaccompanied by impurities that 
normally accompany the native molecule when it is produced in vivo. This fragment has 
an apparent molecular weight of about 18 kDa based upon its migration in Na Dod 
SO,/PAGE as a single peak. The specific activity of purified KQF M on Balb/Mk cells 
is approximately 2.5 x 10 7 units per milligram (ED*, 40 pg/ml), about 7- to 10-fold 
greater than that of the rKGF I63 protein, or aFGF when compared in a cell proliferation 
assay, and 100-fold greater than when rKGF I6J is bioassayed by examining initiation of 
DNA synthesis in Balb/Mk cells in a chemically defined medium, as described in PCT 
patent application no. WO 90/08771. 

In another preferred embodiment of the present invention, KGF,^ is produced 
by recombinant DNA technology, particularly for large-scale commercial production. In 
a further preferred embodiment, recombinant DNA molecule and an expression vector 
each containing KGF^ or analogs thereof in accordance with the present invention is 
carried out by standard gene expression technology using methods well known in the art. 

In a further embodiment, the DNA or vector comprising the DNA encoding 
KGF^.jj can be expressed in a bacterial, mammalian, yeast, or insect cell expression 
system. In a preferred embodiment, a bacterial or yeast cell expression system is ideal for 
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production of the KGF,,^ fragment. In another preferred embodiment, the DNA or 
vector is expressed in an insect cell expression system. 

The KGF fragment of the present invention can be used for identification of 
receptor recognition sites as weU as for the design of peptide agonists or antagonists. 
Moreover, in view of the unique specificity of KGF for keratinocytes, its inability to 
induce the proliferation of vascular endothelial cells or fibroblasts, its and lack of 
cytotoxicity, in another embodiment of the present invention, KGF^ is preferably used 
for wound healing applications, particularly where there is a desire to promote 
re-epithelialization of the skin. A preferred embodiment, KGF^,.* is used in corneal 
epithelial repair. Other applications of KGF taI . B can be envisaged based upon its 
specificity for epithelial cells found in the gastrointestinal tract. 

The selection of the KGF fragment herein over other growth factors such as 
epidermal growth factor ("EGF"), platelet-derived growth factor ("PDGF"), and other 
FGFs for skin repair is within the skill of a person in the art. The other growth factors, 
for example, induce fibroplasia and angiogenesis, in addition to stimulating either directly 
or indirectly keratinocyte proliferation. In skin repair, such additional activities could 
produce undesirable side effects such as scaring. In corneal repair involving either a 
wound or surgery, the use of these other factors could induce blood vessel invasion into 
the cornea, resulting in undesirable cornea opacity or edema. KGF, on the other hand, 
has a unique specificity for keratinocytes and does not induce the proliferation of vascular 
endothelial cells or fibroblasts and, therefore, would be the agent of choice for these 
particular wound healing applications. 

Further, the KGF fragment of the present invention can be used in another 
embodiment of the present invention, in the form of a KGF fragment/toxin conjugate. 
Such a conjugate can slow down or stop the disease process in diseases such as psoriasis 
that results from hyperproliferation of the basal layer of keratinocytes from the epidermis. 
This conjugate would not affect the other two major celltypes such as vascular endothelial 
cells and fibroblasts, present in these tissues. 

In another embodiment of the present invention a KGF fragment/toxin conjugate 
may be used to eradicate tumor cell proliferation, for example, basal cell carcinoma. 
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The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA, and 
immunology, that are within the skill of the art. Such techniques are explained fully in 
the literature. See e.g., Sambrook, et al., MOLECULAR CLONING: A LABORATORY 
5 MANUAL, 2nd ed. Cold Spring Harbor Laboratory Press (1989); DNA CLONING, 
VOLUMES I AND II, D.N Glover ed. IRL Press (1985); OLIGONUCLEOTIDE 
SYNTHESIS M.J: Gait ed., IRL Press (1984); NUCLEIC ACID. HYBRIDIZATION, 
B.D. Haines & SJ. Higgins eds. IRL Press (1984); TRANSCRIPTION AND 
TRANSLATION, B.D. Hames & S.J. Higgins eds. IRL Press (1984); ANIMAL CELL 

10 CULTURE, R.I. Freshney ed., IRL Press (1986); B. Perbal, A PRACTICAL GUIDE TO 
MOLECULAR CLONING, [publisher] (1984); the series, METHODS IN 
ENZYMOLOGY, Academic Press, Inc.; GENE TRANSFER VECTORS FOR 
MAMMALIAN CELLS, J.H. Miller and M.P. Calos eds.. Cold Spring Harbor 
Laboratory (1987); METHODS IN ENZYMOLOGY, Vol. 154 and Vol. 155, Wu and 

15 Grossman, and Wu, eds., respectively, [Mayer and Walker, eds. ????? [publisher] 

(1987), IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY, 
Academic Press, London, Scopes [author??], (1987), PROTEIN PURIFICATION: 
PRINCIPLES AND PRACTICE, 2nd ed. (Springer-Verlag, N.Y. year?), and 
HANDBOOK OF EXPERIMENTAL IMMUNOLOGY, VOLUMES I-IV, D.M. Weir and 

20 C. C. Blackwell eds., (publisher? 1986); Kitts etal., Biotechniques 14:810-817 (1993); 
Munemitsu et al., Mol. and Cell. Biol. 10:5977-5982 (1990). [OTHERS] 

Standard abbreviations for amino acids are used in Fig. 1 and elsewhere in this 
specification. All publications, patents, and patent applications cited herein are 
incorporated by reference. For clarification, the terms used herein are more particularly 

25 defined below. 
Definitions 

As used herein, the term "keratinocyte growth factor" or "KGF" refers to a 
polypeptide that belongs to a family of structurally distinct proteins, the fibroblast growth 
factors, FGFs, that display varying degrees of sequence homology suggesting that they are 
30 encoded by a related family of genes. KGF has the characteristics described elsewhere in 
this specification, for example, it binds to FGFR-3 and is capable of stimulating growth of 
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epithelial cells, especially keratinocytes. Full-length KGF is comprised of 163 amino acid 
residues. 

As used herein, the KGF fragment of the present invention is a polypeptide that is 
composed of a portion of the amino acid sequence of the mature, full-length KGF. An 
5 analog of the KGF fragment, as used herein, includes minor insertions, deletions or 
substitutions of the KGF fragment that do not substantially affect its properties. For 
example, conservative amino acid replacements are contemplated. Such replacements are, 
for example, those that take place within a family of amino acids that are related in their 
side chains. The families of amino acids include (1) acidic: aspartic acid, glutamic acid; 

10 (2) basic: lysine, arginine, histidine; (3) non-polar: alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan; and (4) uncharged polar: glycine, 
asparagine, glutamine, cystine, serine, threonine, tyrosine. Phenylalanine, tryptophan, 
and tyrosine are sometimes classified jointly as a family of aromatic amino acids. In 
particular, it is generally accepted that conservative amino acid replacements consisting of 

15 an isolated replacement of a leucine with an isoleucine or valine, or an aspartic acid with 
a glutamic acid, or a threonine with a serine, or a similar conservative replacement of an 
amino acid with a structurally related amino acid, in an area outside of the polypeptide's 
active site will not have a major effect on the properties of the polypeptide. 

The properties of the KGF fragment include (i) its biological activity such as a 

20 2-10 fold, preferably a 2 fold, most preferably a 7-10 fold increase in stimulation of 

epithelial cell proliferation as compared to the mature, full-length native KGF molecule, 
and (ii) its ability to bind to FGFR-3. 

The term "recombinant polynucleotide" as used herein intends a polynucleotide of 
semisynthetic, or synthetic origin, or encoded by cDNA or genomic DNA ("gDNA") such 

25 that (1) it is not associated with all or a portion of a polynucleotide with which it is 

associated in nature, (2) is linked to a polynucleotide other than that to which it is linked 
in nature, or (3) does not occur in nature. 

A "replicon" is any genetic element, such as a plasmid, a chromosome, a virus, a 
cosmid, etc., that behaves as an autonomous unit of polynucleotide replication within a 

30 cell; i.e., it is capable of replication under its own control. A replicon may include, for 
example selectable markers. 
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A "recombinant vector" is a replicon in which another polynucleotide segment is 
attached, so as to bring about the replication and/or expression of the attached 
polynucleotide segment 

A "regulatory sequence" refers to a polynucleotide sequence that is necessary for 
5 regulation of expression of a coding sequence to which the polynucleotide sequence is 
operably linked. The nature of such regulatory sequences differs depending upon the host 
cell in which the coding sequence is to be expressed. In prokaryotes, such regulatory 
sequences generally include, for example, a promoter, a ribosomal binding site, and/or a 
transcription termination sequence. In eukaryotes, generally, such regulatory sequences 
10 include a promoter and/or a transcription termination sequence. The term "regulatory 

sequence" may also include additional components the presence of which is advantageous, 
for example, a secretory leader sequence that encodes for secretion of the polypeptide to 
be expressed. 

"Operably linked" refers to a juxtaposition wherein the components so linked are 
15 in a relationship permitting them to function in their intended manner. For example, a 
regulatory sequence is "operably linked" to a coding sequence when it is joined in such a 
way that expression of the coding sequence is achieved under conditions compatible with 
the regulatory sequence. 

A "coding sequence" is a polynucleotide sequence that is translated into a 
20 polypeptide, when placed under the control of an appropriate regulatory sequence. The 
boundaries of the coding sequence are generally determined by a translation start codon at 
its 5' -terminus and a translation stop codon at its 3'-terminus. A coding sequence can 
include, for example, cDNA, and recombinant polynucleotide sequences. 

"PCR" refers to the technique of polymerase chain reaction as described in Saiki, 
25 et a/., Nature 324:163 (1986); and Scharf et al. 9 Science 233:1076-1078 (1986); U.S. 
Patent No. 4,683,195; and U.S. Patent No. 4,683,202. 

As used herein, the term "polypeptide" refers to a polymer of amino acids and 
does not refer to a specific length of the product. Thus, peptides, oligopeptides, and 
proteins are included in this term. This term also includes translation modifications of the 
30 polypeptide, for example, glycosylations, acetylations, phosphorylations and the like. 

Further included in this definition are, for example, polypeptides with substituted linkages, 
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as well as other modifications known in the art, both naturally occurring and non-naturally 
occurring. 

As used herein, "terminators" are regulatory sequences, such as polyadenylation 
and transcription termination sequences, located 3' or downstream of the stop codon of the 
5 coding sequences. 

"Recombinant host cells," "host cells," "cells," "cell cultures," and other such 
terms denote, for example, microorganisms, insect cells, and mammalian ceils, that can be 
or have been used as recipients for introduction of recombinant vector or other transfer 
DNA, and include the progeny of the cell that has been transformed. Such progenies 
10 include those that may not necessarily be identical in morphology or in genomic or total 
DNA complement as the original parent, and that may be produced as a result of natural, 
accidental, or deliberate mutation. Examples of mammalian host cells include Chinese 
hamster ovary ("CHO") and monkey kidney ("COS") cells. 

As used herein, the term "microorganism" includes prokaryotic and eukaryotic 
15 microbial species such as bacteria and fungi, the latter including yeast and filamentous 
fungi. 

"Transformation," as used herein, refers to the transfer of an exogenous 
polynucleotide into a host cell, irrespective of the method used for the transfer, which can 
be, for example, by infection, direct uptake, transduction, F-mating, microinjection or 

20 electroporation. The exogenous polynucleotide may be maintained as a non-integrated 
vector, for example, a plasmid, or alternatively, may be integrated into the host genome. 

"Purified" and "isolated" in reference to a polypeptide or a nucleotide sequence 
means that the indicated molecule is present in substantial absence of other biological 
macromolecules of the same species or type. The term "purified" as used herein means at 

25 least 75% by weight; preferably, at least 85% by weight, more preferably, at least 95% 
by weight and, most preferably, at least 98% by weight, of biological macromolecules of 
the same type are present, but water, buffers, and other small molecules, especially mole- 
cules having a molecular weight of less than 1000, can be present as well. 

By "pharmaceutical^ acceptable carrier," is meant any carrier that is used by 

30 persons in the art for a administration into a human that does not itself induce any 

undesirable side effects such as the production of antibodies, fever, etc. Suitable carriers 
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are typically large, slowly metabolized macromolecules that can be a protein, a 
polysaccharide, a polylactic acid, a polyglycolic acid, a polymeric amino acid, amino acid 
copolymers or an inactive vims particle. Other carriers are well known to those of 
ordinary skill in the art. Preferably, the carrier contains thyroglobulin. 

A therapeutic composition typically will contain pharmaceutically acceptable 
carriers, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, 
such as wetting or emulsifying agents, pH buffering substances, and the like, may be 
present in such compositions. 

By "A therapeutically effective amount,* as used herein refers to that amount that 
is effective for production of a desired result. This amount varies depending upon the 
health and physical condition of the individual to be treated, the capacity of the 
individual's immune system to synthesize antibodies, the degree of protection desired, the 
formulation, the treating doctor's assessment of the medical situation, and other relevant 
factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

It has been unexpectedly found that when KGF is expressed in insect cells, 
Spodoptera frugiperda ("SV9")> by infection of a recombinant baculovirus, Aiuographa 
californica, containing the cDNA encoding KGF| M , they produced a KGF fragment that 
lacks the first 23 amino acid residues of the KGF I63 N-terminal domain containing a single 
glycosylation site, in addition to KGF 163 . The truncated and unglycosylated KGF fragment 
KGF^i.23 or KGF140 ,in contrast to the native long form identified as KGF )63 , has a 7- to 
10-fold increased potency on target cells. The target cell specificity of KGF^,.^ is 
unchanged. Additionally, at high concentrations of the KGF fragment, no toxic effect on 
keratinocytes is observed in contrast to that observed for KGF l63 . These observations 
suggest that contrary to what was previously proposed in PCT application no. WO 
90/08771, the target cell specificity of KGF does not reside in its N-terminal domain. 
Furthermore, the present invention shows that an N-terminally truncated version of KGF 
in fact represents an improved KGF version with higher biological activity and decreased 
cytotoxicity for therapeutic application. 

A large amount of the KGF fragment can be produced by recombinant DNA 
techniques which is preferable over the alternative process of isolating and purifying KGF 
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from natural sources and cleaving off the first 23 amino acid residues from the N-terminal 
thereof. KGF produced by recombinant techniques permits the KGF to be isolated in the 
absence of contaminating molecules that are normally present in cells. Indeed, KGF 
compositions entirely free of any trace of human protein contaminants can readily be 
produced in, for example, bacterial cells, yeast cells, and insect cells. By use of 
recombinant DNA techniques, KGF fragments that are not found in nature, such as 
variants that can be produced by site-directed mutagenesis, can also be produced. 

Any promoter that would allow expression of the KGF fragment in the desired 
host can be used in the present invention. For example, in £. coli, the regulatory 
promoter sequence of the lac operon can be used. Another example is the yeast alcohol 
dehydrogenase ("ADH") promoter, which has an upstream activator sequence ("UAS*) 
that modulates the activity of the ADH promoter. Additionally, certain viral enhancers 
can also be used herein. Such enhancers not only amplify but also regulate expression in 
mammalian cells. These enhancers can be incorporated into mammalian promoter 
sequences which will become activated only in the presence of an inducer, such as a 
hormone or an enzyme substrate as discussed in Sassone-Corsi and Borelli Trends Genet. 
2:215 (1986); Maniatis et al. Science 236:1237 (1987). Promoters that may be used in 
the present invention also include the baculovirus polyhedron hybrid promotor and the plO 
promoter. 

Examples of termination sequences that can be used in the present invention are 
the Saccharomyces cerevisiae alpha-factor terminator and the baculovirus terminator. 
Furthermore, viral terminators that are operable in certain host cells can also be used. 
For instance, the SV40 terminator is functional in Chinese Hamster Ovary (CHO) cells. 

When designing truncated proteins, several of the KGFs are preferred, in the 
practice of this invention. One is unglycosylated KGF, for example, truncating the first 
23 amino acids of native KGF. A cDNA encoding a preferred truncated KGF is 
presented in Figure 1. Other species of KGF fragments exist or can be created by routine 
methods. In the sequence shown, the cleavage site for a truncated protein may occur 
where indicated by arrow (a), resulting in a protein having a molecular weight of about 
18,000 Da. Utilizing the sequence data in Figure 1, as well as the denoted characteristics 
of truncated KGF, it is within the skill of the art to obtain other DNA sequences encoding 
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truncated KGF. For example, the structural gene may be manipulated by varying 
individual nucleotides, while retaining the correct amino acid(s), or varying the 
nucleotides, so as to modify the amino acids, without loss of biological activity. 
Nucleotides may be substituted, inserted, or deleted by known techniques, including, for 
5 example, in vitro mutagenesis and primer repair. The structural gene may be truncated at 
its 3*-terminus and/or its S'-terminus while retaining its biological activity. 

S TOPPED HERE 

10 The KGF fragment coding sequence can be constructed by synthesizing the DNA 

sequence encoding the KGF fragment or by altering an existing or native KGF coding 
sequence to produce the desired sequence. The native KGF coding sequence or 
polypeptide are those that occur in nature. The amino acid sequence of the native KGF is 
shown in [FIGURE X]. The synthetic KGF fragment can be made based on the known 

15 amino acid sequence of KGF using codons preferred by the selected host cell as suggested 
in Urdea et al. 9 Proc. Nail Acad. ScL USA 80: 7461 (1983). Alternatively, the desired 
KGF fragment coding sequence can be cloned from nucleic acid libraries using probes 
based on the nucleic acid sequence shown in [FIGURE X]. Techniques for producing and 
probing nucleic acid sequence libraries are described, for example, in Sambrook et a/., 

20 "Molecular Cloning: A Laboratory Manual" (New York, Cold Spring Harbor Laboratory, 
1989). Other recombinant techniques, such as site specific mutagenesis, PCR, enzymatic 
digestion and ligation, can also be used to construct analogs and derivatives of the coding 
sequence of the KGF fragment. The native KGF polypeptide coding sequence can be 
easily modified to create the other classes of KGF polypeptides. 

25 As an example, analogs of the KGF fragment can be created by conservative 

amino acid substitutions that do not impair the activity of the KGF fragment. The present 
invention further contemplates a subset of analogs of the KGF fragment called muteins, in 
which non-disulfide bond participating cysteines of the KGF fragment are substituted, 
generally, with serines. These muteins may be stable over a broader temperature range 

30 than the unmodified KGF fragment. The muteins herein can also contain amino acid 
deletions compared to the native KGF fragment. Muteins will retain at least 20%, 
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preferably at least 50%, and most preferably at least 80% of the activity of the native 
KGF fragment. The coding sequence of muteins can be constructed by in vitro 
mutagenesis of the coding sequence of the native KGF fragment. 

Fragments differ from the native KGF molecule by amino and/dr carboxyl 
terminal amino acid deletions. The number of amino acids that are truncated is not 
critical as long as the KGF fragment does not contain the N-terminal glycosylation site 
and retains at least 20% of the KGF activity of the native KGF molecule. The coding 
sequence of such fragments can be easily constructed by cleaving the unwanted 
nucleotides from the native KGF coding sequence or a variant thereof. 

An expression vector within the present invention contains a promoter that is 
operable in the host cell and is operably linked to the coding sequence of the KGF 
fragment or analog. An expression vector may optionally include a signal sequence for 
secretion, a terminator, a selectable marker, an origin of replication, and sequences 
homologous to host cell sequences. These additional elements can be included to optimize 
expression. 

Functional non-natural promoters may also be used, for example, synthetic 
promoters based on a consensus sequence of different promoters. Also, effective 
promoters can be hybrid promoters that contain a regulatory region linked with a 
heterologous expression initiation region. Examples of hybrid promoters are the E. coli 
lac operator linked to the E. coli tac transcription activation region; the yeast alcohol 
dehydrogenase ( W ADH") regulatory sequence linked to the yeast glyceraldehyde-3- 
phosphate-dehydrogenase ("GAPDH") transcription activation region as described in U.S. 
Patent Nos. 4,876,197 and 4,880,734, incorporated herein by reference; and the 
cytomegalovirus ("CMV") enhancer linked to the simian virus SV40 promoter. 

The coding sequence for the KGF fragment or analog of the present invention 
may also be linked in reading frame to a signal sequence. The signal sequence typically 
encodes an amino acid sequence for secretion comprised of hydrophobic amino acids that 
direct the KGF fragment or analog to the cell membrane. In a preferred embodiment, a 
processing site is located between the signal sequence and the KGF fragment, to allow 
cleavage between the signal sequence and the KGF fragment either in vivo or in vitro. 
Suitable signal sequences are those derived from genes for secreted endogenous host cell 
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proteins, such as the yeast invertase gene as described in European Patent No. 12873 and 
Japanese Patent No. 62,096,086, the A-factor gene as described in U.S. Patent No. 
4,588,684, and interferon signal sequence as described in EP 60057. 
u v, A preferred class of secretion leaders for use in the present invention for yczst 

5 expression is the truncated yeast alpha-factor leader sequence, which contains at least a 
portion of the "pre" signal sequence, and the "gro" region. The alpha-factor leader 
sequences that can be employed herein include the full-length pre-pro alpha factor leader 
having about 83 amino acid residues, as well as truncated alpha-factor leaders, typically 
about 25 to about 50 amino acid residues, as described in U.S. Patent Nos. 4,546,083 and 
10 and 4,870,008, [and U.S. patent application No. xxxxxxx] and European Patent No. EP 
324274 incorporated herein by reference. Additional leaders employing an alpha-factor 
leader fragment that can be used herein include hybrid alpha-factor leaders made with a 
presequence of a first yeast signal sequence, but a pro-region from a second yeast alpha- 
factor. (See e.g. , PCT WO 89/02463.) [LOOK AT THIS] 

15 

f. Expression systems 

KGF fragments and analogs thereof can be expressed by recombinant techniques 
when a DNA sequence encoding the KGF fragment or analog is operably linked into a 
vector in proper reading frame and orientation, as is well understood by those skilled in 

20 the art. When producing a genetic construct containing the KGF fragment, a preferred 
starting material is cDNA encoding the KGF fragment. Typically, the truncated KGF 
gene will be inserted downstream from a promoter and will be followed by a terminator 
sequence although the KGF fragment may also be produced as a hybrid protein that can be 
cleaved if desired. In general, host-cell-specific sequences improving the production yield 

25 of truncated KGF and truncated KGF polypeptide derivatives will be used and appropriate 
control sequences will be added to the expression vector, such as enhancer sequences, 
polyadenylation sequences, and ribosome binding sites. Once the appropriate coding 
sequence is isolated, it can be expressed in a variety of different expression systems; for 
example those used with mammalian cells, baculoviruses, bacteria, and yeast. These are 

30 discussed in turn below. 
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i. Mammalian systems 

Mammalian expression systems are known in the art. A mammalian promoter is 
any DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstr (3') transcription of a coding sequence (e.g. structural gene) into mRNA. A 
5 promoter will have a transcription initiating region, which is usually placed proximal to 
the 5' end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) 
upstream of the transcription initiation site. The TATA box is thought to direct RNA 
polymerase II to begin RNA synthesis at the correct site. A mammalian promoter will 
also contain an upstream promoter element, typically located within 100 to 200 bp 

10 upstream of the TATA box. An upstream promoter element determines the rate at which 
transcription is initiated and can act in either orientation [Sambrook et al. (1989) 
"Expression of Cloned Genes in Mammalian Cells." In Molecular Cloning: A 
Laboratory Manual, 2nd ed.]. 

Mammalian viral genes are often highly expressed and have a broad host range; 

15 therefore sequences encoding mammalian viral genes provide particularly useful promoter 
sequences. Examples include the SV40 early promoter, mouse mammary tumor vims 
LTR promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus 
promoter. In addition, sequences derived from non-viral genes, such as the murine 
metallotheionein gene, also provide useful promoter sequences. Expression may be either 

20 constitutive or regulated (inducible), depending on the promoter can be induced with 
glucocorticoid in hormone-responsive cells. 

The presence of an enhancer element (enhancer), combined with the promoter 
elements described above, will typically increase expression levels. An enhancer is a 
regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to 

25 homologous or heterologous promoters, with synthesis beginning at the normal RNA start 
site. Enhancers are also active when they are placed upstream or downstream from the 
transcription initiation site, in either normal or flipped orientation, or at a distance of 
more than 1000 nucleotides from the promoter [Maniatis et al. (1987) Science 236:1237; 
Alberts et al. (1989) Molecular Biology of the Cell, 2nd ed.]. Enhancer elements derived 

30 from viruses may be particularly useful, because they typically have a broader host range. 
Examples include the SV40 early gene enhancer [Dijkema et al (1985) EMBO J. 4:761] 
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and the enhancer/promoters derived from the long terminal repeat (LTR) of the R us 
Sarcoma Virus [Gorman et al. (1982b) Proc. Natl. Acad, Sci. 79:6777] and from human 
cytomegalovirus [Boshart et al. (1985) Cell 41:521]. Additionally, some enhancers are 
regulatabie and become active only in. -she presence of an inducer, such as a hormone or 
5 metal ion [Sassone-Corsi and Borelli (1986) Trends Genet. 2:215; Maniatis et al. (1987) 
Science 236:1237]. 

A DNA molecule may be expressed intracellularly in mammalian cells. A 
promoter sequence may be directly linked with the DNA molecule, in which case the first 
amino acid at the N-terminus of the recombinant protein will always be a methionine, 

10 which is encoded by the ATG start codon. If desired, the N-terminus may be cleaved 
from the protein by in vitro incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth 
media by creating chimeric DNA molecules that encode a fusion protein comprised of a 
leader sequence fragment that provides for secretion of the foreign protein in mammalian 

15 cells. Preferably, there are processing sites encoded between the leader fragment and the 
foreign gene that can be cleaved either in vivo or in vitro . The leader sequence fragment 
typically encodes a signal peptide comprised of hydrophobic amino acids which direct the 
secretion of the protein from the cell. The adenovirus tripartite leader is an example of a 
leader sequence that provides for secretion of a foreign protein in mammalian cells. 

20 Typically, transcription termination and polyadenylation sequences recognized by 

mammalian cells are regulatory regions located 3* to the translation stop codon and thus, 
together with the promoter elements, flank the coding sequence. The 3' terminus of the 
mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation 
[Bimstiel et al. (1985) Cell 41:349; Proudfoot and Whitelaw (1988) "Termination and 3' 

25 end processing of eukaryotic RNA. In Transcription and splicing (ed. B.D. Haines and 
D.M. Glover); Proudfoot (1989) Trends Biochem. Sci. 14:105]. These sequences direct 
the transcription of an mRNA which can be translated into the polypeptide encoded by the 
DNA. Examples of transcription terminater/poiyadenylation signals include those derived 
from SV40 [Sambrook et al (1989) "Expression of cloned genes in cultured mammalian 

30 cells." In Molecular Cloning: A Laboratory Manual. 
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S me genes may be expressed more efficiently when introns (also called 
intervening sequences) are present. Several cDNAs, however, have been efficiently 
expressed from vectors that lack splicing signals (also called splice donor and acceptor 
sites) [see e.g., Gothing and Sambrcr.k (1981) Nature 293:620]. Introns are intervening 
noncoding sequences within a coding sequence that contain splice donor and acceptor 
sites. They are removed by a process called "splicing," following polyadenylation of the 
primary transcript [Nevins (1983) Annu. Rev. Biochem. 52:441; Green (1986) Annu. 
Rev. Genet. 20:671; Padgett et al. (1986) Annu. Rev. Biochem. 55:1119; Krainer and 
Maniatis (1988) "RNA splicing." In Transcription and splicing (ed. B.D. Hames and 
D.M. Glover)]. 

Typically, the above described components, comprising a promoter, 
polyadenylation signal, and transcription termination sequence are put together into 
expression constructs. Enhancers, introns with functional splice donor and acceptor sites, 
and leader sequences may also be included in an expression construct, if desired. 
Expression constructs are often maintained in a rep 1 icon, such as an extrachromosomal 
element (e.g., plasmids) capable of stable maintenance in a host, such as mammalian cells 
or bacteria. Mammalian replication systems include those derived from animal viruses, 
which require trans-acting factors to replicate. For example, plasmids containing the 
replication systems of papovaviruses, such as SV40 [Gluzman (1981) Cell 23:175] or 
polyomavirus, replicate to extremely high copy number in the presence of the appropriate 
viral T antigen. Additional examples of mammalian replicons include those derived from 
bovine papillomavirus and Epstein-Barr vims. Additionally, the replicon may have two 
replication systems, thus allowing it to be maintained, for example, in mammalian cells 
for expression and in a procaryotic host for cloning and amplification. Examples of such 
mammalian-bacteria shuttle vectors include pMT2 [Kaufman et al. (1989) Mol. Cell. Biol. 
9:946 and pHEBO [Shimizu et al. (1986) Mol. Cell. Biol. 6:1074]. 

The transformation procedure used depends upon the host to be transformed. 
Methods for introduction of heterologous polynucleotides into mammalian cells are known 
in the art and include dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 
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Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell lines available from the American Type Culture Collection 
(ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, 
baby hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular 
5 carcinoma cells (e.g., Hep G2), and a number of other cell lines. 

ii. Baculovirus systems 

The polynucleotide encoding KGF can be inserted into a suitable expression 
vector, for example, an insect cell expression vector, and operably linked to control 
elements within that vector. Vector construction employs techniques which are known in 
10 the art. In particular, for the purpose of the present invention, a baculovirus expression 
vector is constructed substantially in accordance to Kitts et al., BioTechniques 14: 810-817 
(1993). 

Briefly, a KGF expression cassette is first constructed by insertion of the KGF !63 
coding sequence into a transfer vector containing a polynucleotide sequence that is 

15 homologous to a portion of the baculovirus genome (referred herein as "the baculovirus 
sequences") and is capable of homolgous recombination therewith. The bacuolvirus 
sequences contains at least an essential polynucleotide sequence, as described in more 
detail below. The KGF coding sequence is inserted into the transfer vector in such a 
manner that it is flanked on both ends by the baculovirus sequences. 

20 The transfer vector containing the KGF coding sequence is transfected into the 

host insect cells together with a mutant of wild-type baculovirus that lacks an essential 
polynucleotide sequence necessary for production of a functional virus. In this regard, a 
functional virus can be produced in the host cells upon transfection when the mutant 
baculovirus recombines with the KGF expression cassette. 

25 The functional baculovirus produced in this manner, therefore, incorporates the 

KGF expression cassette and is suitable for transfection into new host cells for production 
of recombinant KGF and recombinant KGFaa,.^. Alternatively, this process can be 
conducted with a KGF^,.^ coding sequence instead of a KGF, 63 coding sequence. 

The functional baculovirus Generally, the components of the expression system 

30 include a transfer vect r, usually a bacterial plasmid, which contains both a fragment of 
the baculovirus genome, and a convenient restriction site for insertion of the heterologous 
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gene r genes to be expressed; a wild type baculovims with a sequence homologous to the 
baculovims-specific fragment in the transfer vector (this allows for the homologous 
recombination of the heterologous gene in to the baculovims genome); and appropriate 
insect host cells and growth media. 

After inserting the truncated KGF DNA sequence into the transfer vector, the 
vector and the wild type viral genome are transfected into an insect host cell where the 
vector and viral genome are allowed to recombine. The packaged recombinant virus is 
expressed and recombinant plaques are identified and purified. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, 
inter alia . Invitrogen, San Diego CA ("MaxBac" kit). These techniques are generally 
known to those skilled in the art and fully described in Summers and Smith, Texas 
Agricultural Experiment Station Bulletin No. 1555 (1987) (hereinafter "Summers and 
Smith"), and incorporated by reference. 

Prior to inserting the truncated KGF DNA sequence into the baculovirus genome, 
the above described components, comprising a promoter, leader (if desired), coding 
sequence of interest, and transcription termination sequence, are typically assembled into 
an intermediate displacement construct (transfer vector). This construct may contain a 
single gene and operably linked regulatory elements; multiple genes, each with its owned 
set of operably linked regulatory elements; or multiple genes, regulated by the same set of 
regulatory elements. Intermediate transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable 
maintenance in a host, such as a bacterium. The replicon will have a replication system, 
thus allowing it to be maintained in a suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes 
into AcNPV is pAc373. Many other vectors, known to those of skill in the art, have also 
been designed. These include, for example, pVL985 (which alters the polyhedrin start 
codon from ATG to ATT, and which introduces a BamHI cloning site 32 basepairs 
downstream from the ATT; see Luckow and Summers, Virology (1989) 17:31. 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et 
al. (1988) Ann. Rev. Microbiol., 42:177) and a procaryotic ampicillin-resistance (amp) 
gene and origin of replication for selection and propagation in JL £Qli- 
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Baculovims transfer vectors usually contain a baculovirus promoter. A 
baculovirus promoter is any DNA sequence capable of binding a baculovirus RNA 
polymerase and initiating the downstream (5' to 3') transcription of a coding sequence 
(e.g. structural gene) into mRNA. A promoter will have a transcription initiation region 
5 which is usually placed proximal to the 5* end of the coding sequence. This transcription 
initiation region typically includes an RNA polymerase binding site and a transcription 
initiation site. A baculovirus transfer vector may also have .a second domain called an 
enhancer, which, if present, is usually distal to the structural gene. Expression may be 
either regulated or constitutive. 

10 Structural genes, abundantly transcribed at late times in a viral infection cycle, 

provide particularly useful promoter sequences. Examples include sequences derived from 
the gene encoding the viral polyhedron protein, Friesen et al., (1986) "The Regulation of 
Baculovirus Gene Expression," in: The Molecular Biology of Baculovimses (ed. Walter 
Doerfler); EPO Pub. Nos. 127,839 and 155,476; and the gene encoding the plO protein 

15 Vlak et al., (1988), J. Gen. Virol. 69:765. 

DNA encoding suitable signal sequences can be derived from genes for secreted 
insect or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. 
(1988) Gene, 73:409). Alternatively, since the signals for mammalian cell 
posttranslational modifications (such as signal peptide cleavage, proteolytic cleavage, and 

20 phosphorylation) appear to be recognized by insect cells, and the signals required for 
secretion and nuclear accumulation also appear to be conserved between the invertebrate 
cells and vertebrate cells, leaders of non-insect origin, such as those derived from genes 
encoding human a-interferon, Maeda et al., (1985), Nature 315:592; human gastrin- 
releasing peptide, Lebacq-Verheyden et al., (1988), Molec. Ceil. Biol. 8:3129; human IL- 

25 2, Smith et al., (1985) Proc. Nat'l Acad. Sci. USA, 82:8404; mouse IL-3, (Miyajima et 
al M (1987) Gene 58:273; and human glucocerebrosidase, Martin et al. (1988) DNA, 7:99, 
can also be used to provide for secretion in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellularly or, if 
it is expressed with the proper regulatory sequences, it can be secreted. Good 

30 intracellular expression of nonfused foreign proteins usually requires heterologous genes 
that ideally have a short leader sequence containing suitable translation initiation signals 
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preceding an ATG start signal. If desired, methionine at the N-terminus may be cleaved 
from the mature protein by in vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally 
secreted can be secreted from the insect cell by creating chimeric DNA molecules that 
encode a fusion proton comprised of a leader sequence fragment that provides for 
secretion of the foreign protein in insects. The leader sequence fragment typically 
encodes a signal peptide comprised of hydrophobic amino acids which direct the 
translocation of the protein into the endoplasmic reticulum. 

After insertion of the truncated KGF DNA sequence and/or the gene encoding the 
expression product precursor, an insect cell host is co-transformed with the heterologous 
DNA of the transfer vector and the genomic DNA of wild type baculovirus — usually by 
co-transfection. The promoter and transcription termination sequence of the construct will 
typically comprise a 2-5kb section of the baculovirus genome. Methods for introducing 
heterologous DNA into the desired site in the baculovirus virus are known in the art. 
(See Summers and Smith supra : Ju et al. (1987); Smith et aL, Mol. CelL Biol. (1983) 
3:2156; and Luckow and Summers (1989)). For example, the insertion can be into a gene 
such as the polyhedrin gene, by homologous double crossover recombination; insertion 
can also be into a restriction enzyme site engineered into the desired baculovirus gene. 
Miller et al., (1989), Bioessays 4:91. The DNA sequence, when cloned in place of the 
polyhedrin gene in the expression vector, is flanked both 5' and 3* by polyhedrin-specific 
sequences and is positioned downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is subsequently packaged into an 
infectious recombinant baculovirus. Homologous recombination occurs at low frequency 
(between about 1% and about 5%); thus, the majority of the virus produced after 
cotransfection is still wild-type virus. Therefore, a method is necessary to identify 
recombinant viruses. An advantage of the expression system is a visual screen allowing 
recombinant viruses to be distinguished. The polyhedrin protein, which is produced by 
the native virus, is produced at very high levels in the nuclei of infected cells at late times 
after viral infection. Accumulated polyhedrin protein forms occlusion bodies that also 
contain embedded particles. These occlusion bodies, up to 15 /xm in size, are highly 
refractile, giving them a bright shiny appearance that is readily visualized under the light 
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microscope. Cells infected with recombinant viruses lack occlusion bodies. To 
distinguish recombinant virus from wild-type virus, the transfection supernatant is plaqued 
onto a monolayer of insect cells by techniques known to those skilled in the art. Namely, 
the plaques are screened under the light microscope for the presence (indicative of wild- 
5 type virus) or absence (indicative of recombinant virus) of occlusion bodies. "Current 
Protocols in Microbiology" Vol. 2 (Ausubel et ah eds) at 16.8 (Supp. 10, 1990); 
Summers and Smith, supra : Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection 
into several insect cells. For example, recombinant baculovi ruses have been developed 
10 for, inter alia: Aedes aepypti , Autographa californica . Bombvx mori . Drosophila 

melanogaster. Soodoptera frugiperda . and Trichoplusia ni (PCT Pub. No. WO89/046699; 
Carbonell et al., (1985) J. Virol. 56:153; Wright (1986) Nature 221:718; Smith et al., 
(1983) Mol, Qe\h B\q} t 2:2156; and see generally, Fraser, sUL (1989) In Vitro Cell. 
Ds^BioL 25:225). 

15 Cells and cell culture media are commercially available for both direct and fusion 

expression of heterologous polypeptides in a baculovirus/expression system; cell culture 
technology is generally known to those skilled in the art. See, e.g. . Summers and Smith 
supra . 

The modified insect cells may then be grown in an appropriate nutrient medium, 
20 which allows for stable maintenance of the plasmid(s) present in the modified insect host. 
Where the expression product gene is under inducible control, the host may be grown to 
high density, and expression induced. Alternatively, where expression is constitutive, the 
product will be continuously expressed into the medium and the nutrient medium must be 
continuously circulated, while removing the product of interest and augmenting depleted 
25 nutrients. The product may. be purified by such techniques as chromatography, e.g., 
HPLC, affinity chromatography, ion exchange chromatography, etc.; electrophoresis; 
density gradient centrifugation; solvent extraction, or the like. As appropriate, the 
product may be further purified, as required, so as to remove substantially any insect 
proteins which are also secreted in the medium or result from lysis of insect cells, so as to 
30 provide a product which is at least substantially free of host debris, e.g., proteins, lipids 
and polysaccharides. 
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In rder to obtain truncated KGF expression, recombinant host cells derived from 
the transformants are incubated under conditions which allow expression of the 
recombinant truncated KGF encoding sequence. These conditions will vary, dependent 
upon the host cell selected. However, the conditions are readily ascertainable to those of 
ordinary skill in the art, based upon what is known in the art. 

iii. Bacterial systems 

Bacterial expression techniques are known in the art. A bacterial promoter is any 
DNA sequence capable of binding bacterial RNA polymerase and initiating the 
downstream (3") transcription of a coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which is usually placed proximal to the 
5' end of the coding sequence. This transcription initiation region typically includes an 
RNA polymerase binding site and a transcription initiation site. A bacterial promoter may 
also have a second domain called an operator, that may overlap an adjacent RNA 
polymerase binding site at which RNA synthesis begins. The operator permits negative 
regulated (inducible) transcription, as a gene repressor protein may bind the operator and 
thereby inhibit transcription of a specific gene. Constitutive expression may occur in the 
absence of negative regulatory elements, such as the operator. In addition, positive 
regulation may be achieved by a gene activator protein binding sequence, which, if 
present is usually proximal (5*) to the RNA polymerase binding sequence. An example of 
a gene activator protein is the catabolite activator protein (CAP), which helps initiate 
transcription of the lac operon in Escherichia coli (E. coli) [Raibaud et al. (1984) Annu. 
Rev. Genet. 18:173]. Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful 
promoter sequences. Examples include promoter sequences derived from sugar 
metabolizing enzymes, such as galactose, lactose (lac) [Chang ei al. (1977) Nat u re 
122:1056], and maltose. Additional examples include promoter sequences derived from 
biosynthetic enzymes such as tryptophan (trg) [Goeddel et al. (1980) Nuc. Acids Res. 
8:4057; Yelverton §t al. (1981) Nucl. Acids Res. 2:731; U.S. 4,738,921; EPO Pub. 
Nos. 036 776 and 121 775]. The g-laotamase (Ma) promoter system [Weissmann (1981) 
"The cloning of interferon and other mistakes." In Interferon 3 (ed. I. Gresser)], 



WO 95/01434 



PCT/US94/04694 



-25- 



bacteriophage lambda PL [Shimatake et al. (1981) Nature 292:128] and T5 
[U.S. 4,689,406] promoter systems also provide useful promoter sequences. 

In addition, synthetic promoters which do not occur in nature also function as 
bacterial promoters. For example, transcription activation sequences of one bacterial or 
5 bacteriophage promoter may be joined with the operon sequences of another bacterial or 
bacteriophage promoter, creating a synthetic hybrid promoter [U.S. Patent 
No. 4,551^433]. For example, the tag promoter is a hybrid trp-lac promoter comprised of 
both £e promoter and lac operon sequences that is regulated by the lag repressor [Amann 
fit al. (1983) Gene 25:167; de Boer §t al. (1983) Proc. Nad. Acad. Sci. 3Q:21]. 

10 Furthermore, a bacterial promoter can include naturally occurring promoters of non- 
bacterial origin that have the ability to bind bacterial RNA polymerase and initiate 
transcription. A naturally occurring promoter of non-bacterial origin can also be coupled 
with a compatible RNA polymerase to produce high levels of expression of some genes in 
prokaryotes. The bacteriophase T7 RNA polymerase/promoter system is an example of a 

15 coupled promoter system [Studier et aL (1986) J, Mol. Biol. 189 : 1 13; Tabor e| al- (1985) 
Proc Natl. Acad. ScL 82:1074]. In addition, a hybrid promoter can also be comprised of 
a bacteriophage promoter and an IL coli operator region (EPO Pub. No. 267 851), 

In addition to a functioning promoter sequence, an efficient ribosome binding site 
is also useful for the expression of foreign genes in prokaryotes. In E. coli . the ribosome 

20 binding site is called the Shine-Dalgarno (SD) sequence and includes an initiation codon 
(ATG) and a sequence 3-9 nucleotides in length located 3-11 nucleotides upstream of the 
initiation codon [Shine et al. (1975) Nature 254:34]. The SD sequence is thought to 
promote binding of mRNA to the ribosome by the pairing of bases between the SD 
sequence and the 3* and of E. coH 16S rRNA [Steitz & aL (1979) "Genetic signals and 

25 nucleotide sequences in messenger RNA" In Biological Regulation and Development: 
Gene Expression (ed. R.F. Goldberger)]. To express eukaryotic genes and prokaryotic 
genes with weak ribosome-binding site [Sambrook e| al. (1989) "Expression of cloned 
genes in Escherichia coli" In Molecular Cloning: A Laboratory Manual . 

A DNA molecule may be expressed intracellular^ . A promoter sequence may be 

30 directly linked with the DNA m lecule, in which case the first amino acid at the N- 
terminus will always be a methionine, which is encoded by the ATG start codon. If 
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desired, methionine at the N-terminus may be cleaved from the protein by in vitro 
incubation with cyanogen bromide or by either in vivo on in vitro incubation with a 
bacterial methionine N-terminal peptidase (EPO Pub. No. 219 237). 

Fusion proteins provide an alternative to direct expression. Typically, a DNA 
5 sequence encoding the N-terminal portion of an endogenous bacterial protein, or other 
stable protein, is fused to the 5' end of heterologous coding sequences. Upon expression, 
this construct will provide a fusion of the two amino acid sequences. For example, the 
bacteriophage lambda cell gene can be linked at the 5 1 terminus of a foreign gene and 
expressed in bacteria. The resulting fusion protein preferably retains a site for a 

10 processing enzyme (factor Xa) to cleave the bacteriophage protein from the foreign gene 
[Nagai si al* (1984) Nature 202*810]. Fusion proteins can also be made with sequences 
from iagZ [Jia e| al. (1987) Gene 60:197]; fapg [Allen et al. (1987) J. Biotechnol. 5:93; 
Makoff et §1. (1989) J. Gen. Microbiol. 125:11]; and Chev [EPO Pub. No. 324 647] 
genes. The DNA sequence at the junction of the two amino acid sequences may or may 

15 not encode a cleavable site. Another example is a ubiquitin fusion protein. Such a fusion 
protein is made with the ubiquitin region that preferably retains a site for a processing 
enzyme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin from the 
foreign protein. Through this method, native foreign protein can be isolated [Miller el gl. 
(1989) Bio/Technology 7:698]. 

20 Alternatively, foreign proteins can also be secreted from the cell by creating 

chimeric DNA molecules that encode a fusion protein comprised of a signal peptide 
sequence fragment that provides for secretion of the foreign protein in bacteria [U.S. 
4,336,336]. The signal sequence fragment typically encodes a signal peptide comprised of 
hydrophobic amino acids which direct the secretion of the protein from the cell. The 

25 protein is either secreted into the growth media (gram-positive bacteria) or into the 
periplasmic space, located between the inner and outer membrane of the cell (gram- 
negative bacteria). Preferably there are processing sites, which can be cleaved either in 
vivo or in vitro encoded between the signal peptide fragment and the foreign gene. 

DNA encoding suitable signal sequences can be derived from genes for secreted 

30 bacterial proteins, such as the E. coli outer membrane protein gene (ompAl [Masui et al. 
(1983), in: Experimental Manipulation of Gene Expression : Ghrayeb et al. (1984) EMBO 
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L 2:2437] and the £. coii alkaline phosphatase signal sequence (phoA> [Oka et al. (1985) 
Proc. Natl, Acad. Sci. £2:7212]. As an additional example, the signal sequence of the 
alpha-amyiase gene from various Bacilus strains can be used to secrete heterologous 
proteins from El syMUs [Palva st al-' (1982) Proc. Natl. Acad. Sci. USA 22:5582; EPO 
5 Pub. No. 244 042]. 

Typically, transcription termination sequences recognized by bacteria are 
regulatory regions located 3 1 to the translation stop cod on, and thus together with the 
promoter flank the coding sequence. These sequences direct the transcription of an 
mRNA which can be translated into the polypeptide encoded by the DNA. Transcription 

10 termination sequences frequently include DNA sequences of about 50 nucleotides capable 
of forming stem loop structures that aid in terminating transcription. Examples include 
transcription termination sequences derived from genes with strong promoters, such as the 
fig gene in E. coli as well as other biosynthetic genes. 

Typically, the above described components, comprising a promoter, signal 

15 sequence (if desired), coding sequence of interest, and transcription termination sequence, 
are put together into expression constructs. Expression constructs are often maintained in 
a rcplicon, such as an extrachromosomal element (e.g., plasmids) capable of stable 
maintenance in a host, such as bacteria. The replicon will have a replication system, thus 
allowing it to be maintained in a procaryotic host either for expression or for cloning and 

20 amplification. In addition, a rcplicon may be either a high or low copy number plasmid. 
A high copy number plasmid will generally have a copy number ranging from about 5 to 
about 200, and typically about 10 to about 150. A host containing a high copy number 
plasmid will preferably contain at least about 10, and more preferably at least about 20 
plasmids. Either a high or low copy number vector may be selected, depending upon the 

25 effect of the vector and the foreign protein on the host. 

Alternatively, the expression constructs can be integrated into the bacterial 
genome with an integrating vector. Integrating vectors typically contain at least one 
sequence homologous to the bacterial chromosome that allows the vector to integrate. 
Integrations appear to result from recombinations between homologous DNA in the vector 

30 and the bacterial chromosome. For example, integrating vectors constructed with DNA 
from various Bacillus strains integrate into the Bacillus chromosome (EPO Pub. No. 127 
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328). Integrating vectors may also be comprised of bacteriophage or transposon 
sequences. 

Typically, extrachromosomal and integrating expression constructs may contain 
selectable m? ':crs to allow for the selection of bacterial strains that have been 
transformed. Selectable markers can be expressed in the bacterial host and may include 
genes which render bacteria resistant to drugs such as ampicillin, chloramphenicol, 
erythromycin, kanamycin (neomycin), and tetracycline [Davies £i &[. (1978) Annu. 
Rev.MicrobioL 22:469]. Selectable markers may also include biosynthetic genes, such as 
those in the histidine, tryptophan, and leucine biosynthetic pathways. 

Alternatively, some of the above described components can be put together in 
transformation vectors. Transformation vectors are typically comprised of a selectable 
market that is either maintained in a replicon or developed into an integrating vector, as 
described above. 

Expression and transformation vectors, either extra-chromosomal replicons or 
integrating vectors, have been developed for transformation into many bacteria. For 
example, expression vectors have been developed for, inter alia , the following bacteria: 
Bacillus subtilis [Palva & al- (1982) Proc. Natl. Acad. Sci. USA 22:5582; EPO Pub. Nos. 
036 259 and 063 953; PCT WO 84/04541]; Escherichia coli [Shimatake ei aL (1981) 
Nature 222: 128; Amann ej al. (1985) 4Q:183; Studier & ai. (1986) J. Mol. Biol. 
189:113; EPO Pub. Nos. 036 776, 136 829 and 136 907]; Streptococcus cremoris [Powell 
§1 al- (1988) A ppl. Enviro n. Microbiol. 54:655]; Streptococcus ljvidans [Powell & ai. 
(1988) A ppl. Environ . Microbiol, 54:655]; Stfeptomyces |ivjdans [U.S. 4,745,056]. 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the 
art, and typically include either the transformation of bacteria treated with CaCL, or other 
agents, such as divalent cations and DMSO. DNA can also be introduced into bacterial 
cells by electroporation. Transformation procedures usually vary with the bacterial 
species to be transformed. See e.g., [Masson ej aL H989V FEMS Microbiol. Lett. 
$Q:273; Palva ei M- (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Pub. Nos. 036 259 
and 063 953; PCT Publication No. WO84/04541, Bacillus! . [Miller & al. (1988) Proc. 
Natl. Acad. Sci. 85:856; Wang si al. (1990) J. Bacteriol. 172:949, Campylobacter! : 
[Cohen et al- (1973) Proc. Natl. Acad. Sci. 69:2110: Dower el al. (1988) Nucleic Acids 
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fiSS*. 16:6127; Kushner (1978) "An improved method for transformation of Escherichia 
coli with ColEl-derived plasmids. In Genetic Engin eering: Proceedings of the 
International Symposium on Genetic Engineering (eds. H.W. Boyer and S. Nicosia); 
Mandel §1 at (1970) J. Mol. Biol. 52:159; Taketo (1988) Biochim. Bionhw Ar^ 
5 242:318; Escherichia!: [Chassy ej al. (1987) FEMS Microbiol. Lett. 44: 173 

LaetebacHlus]; [Fiedler el (1988) Anal. Biochem 170:38, Pseudomonasl : [Augustin et 
ii- (1990) FEMS Microbiol. Lett. £6:203, Staph vlococcusl : rRarany el al. (1980) L 
BactgriQl, 1M:698; Harlander (1987) "Transformation of Streptococcus lactis by 
electroporation, in: Streptococcal Genetic (ed. J. Ferretti and R. Curtiss III); Perry et al. 
10 (1981) Ipfec. Immun. 32: 1295; Powell et al. (1988) ApdI. Environ. Microbiol. 54:655; 
Somkuti et ai- (1987) Proc. 4th Evr. Cong. Biotechnology i:412, Streptococcus !, 
iv. Yeast expression 

Yeast expression systems are also known to one of ordinary skill in the art A 
yeast promoter is any DNA sequence capable of binding yeast RNA polymerase and 

15 initiating the downstream (3*) transcription of a coding sequence (e.g. structural gene) into 
mRNA. A promoter will have a transcription initiation region which is usually placed 
proximal to the 5* end of the coding sequence. This transcription initiation region 
typically includes an RNA polymerase binding site (the TATA Box") and a transcription 
initiation site. A yeast promoter may also have a second domain called an upstream 

20 activator sequence (UAS), which, if present, is usually distal to the structural gene. The 
UAS permits regulated (inducible) expression. Constitutive expression occurs in the 
absence of a UAS. Regulated expression may be either positive or negative, thereby 
either enhancing or reducing transcription. 

Yeast is a fermenting organism with an active metabolic pathway, therefore 

25 sequences encoding enzymes in the metabolic pathway provide particularly useful 

promoter sequences. Examples include alcohol dehydrogenase (ADH)(EPO Pub. No. 284 
044), enolase, glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate- 
dehydrogenase (GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate 
mutase, and pyruvate kinase (PyK) (EPO Pub. No. 329 203). The yeast PHQ5 gene, 

30 encoding acid phosphatase, also provides useful promoter sequences [Myanohara et al. 
(1983) Proc. Natl. Ac ad. Sci. USA 8Q: 1]. 
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In addition, synthetic promoters which do n t occur in nature also function as 
yeast promoters. For example, UAS sequences of one yeast promoter may be joined with 
the transcription activation region of another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the ADH regulatory sequence 
linked to the GAP transcription activation region (U.S. 4,876,197 and U.S. 4,880,734). 
Other examples of hybrid promoters include promoters which consist of the regulatory 
sequences of either the ADH2 . GAL4 . GAL10 . or PHOS genes, combined with the - 
transcriptional activation region of a glycolytic enzyme gene such as GAP or PyK (EPO 
Pub. No. 164 556). Furthermore, a yeast promoter can include naturally occurring 
promoters of non-yeast origin that have the ability to bind yeast RNA polymerase and 
initiate transcription. Examples of such promoters include, inter alia . [Cohen et al. (1980) 
Proc. Natl. Acad. Sci. USA 72:1078; Henikoff & al. (1981) Nature 222:835; Hollenberg 
e$ al. (1981) Curr. Topics Microbiol. Immunol. 26:119; Hollenberg et al. (1979) "The 
Expression of Bacterial Antibiotic Resistance Genes in the Yeast Saccharomvces 
cerevisiae." in: Plasmids of Medical. Environmental and Commercial Importance (eds. 
K.N. Timmis and A. Puhler); Mercerau-Puigalon e| ah (1980) Gene 11:163; Panthier et 
al. (1980) Curr. Genet. 2:109]. 

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence 
may be directly linked with the DNA molecule, in which case the first amino acid at the 
N-terminus of the recombinant protein will always be a methionine, which is encoded by 
the ATG start codon. If desired, methionine at the N-terminus may be cleaved from the 
protein by in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in 
mammalian, baculovirus, and bacterial expression systems. Typically, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct 
will provide a fusion of the two amino acid sequences. For example, the yeast or human 
superoxide dismutase (SOD) gene, can be linked at the 5' terminus of a foreign gene and 
expressed in yeast. The DNA sequence at the junction of the two amino acid sequences 
may or may not encode a cleavable site. See e.g., EPO Pub. No. 196 056- Another 
example is a ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin 



WO 95/01434 



PCT/US94/04694 



-31 - 



region that preferably retains a site for a processing enzyme (e.g. ubiquitin-specific 
processing protease) to cleave the ubiquitin from the foreign protein. Through this 
method, therefore, native foreign protein can be isolated (see, e.g., PCT Publ. No. 
WO88/024066). 

5 Alternatively, foreign proteins can also be secreted from the cell into the growth 

media by creating chimeric DNA molecules that encode a fusion protein comprised of a 
leader sequence fragment that provide for secretion in yeast of the foreign protein. 
Preferably, there are processing sites encoded between the leader fragment and the foreign 
gene that can be cleaved either in vivo or in vitro . The leader sequence fragment 

10 typically encodes a signal peptide comprised of hydrophobic amino acids which direct the 
secretion of the protein from the cell. 

DNA encoding suitable signal sequences can be derived from genes for secreted 
yeast proteins, such as the yeast invertase gene (EPO Pub. No. 012 873; JPO Pub. No. 
62,096,086) and the A-factor gene (U.S. 4,588,684). Alternatively, leaders of non-yeast 

15 origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO 
Pub. No. 060 057). 

A preferred class of secretion leaders are those that employ a fragment of the 
yeast alpha factor gene, which contains both a "pre" signal sequence, and a "pro" region. 
The types of alpha factor fragments that can be employed include the full-length pre-pro 

20 alpha factor leader (about 83 amino acid residues) as well as truncated alpha-factor leaders 
(typically about 25 to about 50 amino acid residues) (U.S. 4,546,083 and U.S. 4,870,008; 
EPO Publ. No. 324 274). Additional leaders employing an alpha-factor leader fragment 
that provides for secretion include hybrid alpha factor leaders made with a presequence of 
a first yeast, but a pro-region from a second yeast alpha factor. See e.g., PCT Publ. No. 

25 WO89/02463. 

Typically, transcription termination sequences recognized by yeast are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank 
the coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
30 sequence and other yeast-recognized termination sequences, such as those coding for 
glycolytic enzymes. 
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Typically, the above described components, comprising a promoter, leader (if 
desired), coding sequence of interest, and transcription termination sequence, are put 
together into expression constructs. Expression constructs are often maintained in a 
repiicon, such as an extrachromosomal element (e.g., plasmids) capable of stable 
5 maintenance in a host, such as yeast or bacteria. The repiicon may have two replication 
systems, thus allowing it to be maintained, for example, in yeast for expression and in a 
procaryotic host for cloning and amplification. Examples of such yeast-bacteria shuttle 
vectors include YEp24 [Botstein ej al. (1979) Gene 8:17-24]; pCI/1 [Brake £| al. (1984) 
Pro. Natl, Acad. Spi USA £1:4642-4646]; and YRpl7 [Stinchcomb 3 al. (1982) J. Mol. 

10 Biol. 158:157]. In addition, a repiicon may be either a high or low copy number plasmid. 
A high copy number plasmid will generally have a copy number ranging from about 5 to 
about 200, and typically about 10 to about 150. A host containing a high copy number 
plasmid will preferably have at least about 10, and more preferably at least about 20. 
Enter a high or low copy number vector may be selected, depending upon the effect of the 

15 vector and the foreign protein on the host See e.g., Brake e| al., supra . 

Alternatively, the expression constructs can be integrated into the yeast genome 
with an integrating vector. Integrating vectors typically contain at least one sequence 
homologous to a yeast chromosome that allows the vector to integrate, and preferably 
contain two homologous sequences flanking the expression construct. Integrations appear 

20 to result from recombinations between homologous DNA in the vector and the yeast 
chromosome [Orr- Weaver gt al. (1983) Methods in Enzvmol. 101:228-245]. An 
integrating vector may be directed to a specific locus in yeast by selecting the appropriate 
homologous sequence for inclusion in the vector. See Orr-Weaver et ah, supra . One or 
more expression construct may integrate, possibly affecting levels of recombinant protein 

25 produced [Rine e£ al. (1983) Proc. Natl. Acad. Sci. USA 80:6750]. The chromosomal 
sequences included in the vector can occur either as a single segment in the vector, which 
results in the integration of the entire vector, or two segments homologous to adjacent 
segments in the chromosome and flanking the expression construct in the vector, which 
can result in the stable integration of only the expression construct. 

30 Typically, extrachromosomal and integrating expression constructs may contain 

selectable markers to allow for the selection of yeast strains that have been transformed. 



« 
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Selectable markers may include biosynthetic genes that can be expressed in the yeast host, 
such as APE?, HIS4 . LEU2 . TRP1 . and AL2Z, and the G418 resistance gene, which 
confer resistance in yeast cells to tunicamycin and G418 . respectively. In addition, a 
suitable selectable marker may also provide yeast with the ability to grow in the presence 
5 of toxic compounds, such as metal. For example, the presence of CUP1 allows yeast to 
grow in the presence of copper ions [Butt fit (1987) Microbiol. Rev. 51:351]. 

Alternatively, some of the above described components can be put together into 
transformation vectors. Transformation vectors are typically comprised of a selectable 
marker that is either maintained in a replicon or developed into an integrating vector, as 

10 described above. 

Expression and transformation vectors, either extrachromosomal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For 
example, expression vectors have been developed for, inter alia , the following yeasts: 
Candida albicans [Kurtz, et al (1986) Mol. Cell. BioL 6:142]; Candida mahosa (Kunze, 

15 et al. 91985) J. Basic Microbiol. 25:141]; Hansenula polvmorpha [Gleeson, et al. (1986) 
J. Gen. Microbiol 122:3459; Roggenkamp et al. (1986) Mol. Gen. Genet. 202:302]; 
Kluweromvces frapilis [Das, et al. (1984) J. Bacterid. 13:1165]; Kluweromvces lactis 
[De Louvencourt et al. (1983) h Pac te ri o U 154:737; Van den Berg fit al. (1990) 
Pip/Technplogy 8:135]; Pichia guillerimondii [Kunze et al- (1985) J. Basic Microbiol. 

2 0 25:141]; Pichi^ pastoris [Cregg, gi al. (1985) Mol. Cell. Biol. 5:3376; U.S. 4,837,148 
and U.S. 4,929,555]; Saccharomvces cerevisiae [Hinnen fit al. (1978) Proc. Natl. Acad, 
gcj t USA 25:1929; Itoetal (1983) J. Bacteriol. 153:1631: Schizosaccharomvces pombe 
[Beach and Nurse (1981) Nature 2SQ:706]; and Yarrowia lipolvtica [Davidow, et al. 
(1985) Curr. Genet. 10:380471; Gaillardin, et al. (1985) Curr. Genet. 10:49]. 

25 Methods of introducing exogenous DNA into yeast hosts are well-known in the 

art, and typically include either the transformation of spheroplasts or of intact yeast cells 
treated with alkali cations. Transformation procedures usually vary with the yeast species 
to be transformed. See e.g., [Kurtz et al. (1986) Mol. Cell. Biol. 6:142; Kunze fit al- 
91985) J, B^ic MiprQfopl, 25:141, Candidal : [Gleeson et al. 91986) J. Gen. Microbiol. 

30 122:3459; Roggenkamp et al. (1986) Mol. Gen. Genet. 202:302, Hansenula l: [Das et al. 
(1984) J. Bacteriol. 158:1165; De Louvencourt et al. (1983) J. Bacteriol. 154:1165; Van 
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den Berg §1 al (1990) Bio/Technology j|:135, Kluweromvcesl : [Cregg gj al. (1985) Mol. 
Cell. Biol. 5:3376; Kunze et al. (1985) J. Basic Microbiol. 23:141; U.S. 4,837,148 and 
U.S. 4,929,555, Pichial : [Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 7$; 1929; Ito 
gt al. (1983) J. Bacteriol. 152:163, SareharqpiyresV, [Beach and Nurse (1981) N^re 
2QQ:706, Schizosaccharomvcesl : [Davidow e£ al- (1985) Qm, Q^e\ t 10:39; Gaillardin et 
al. (1985) Cm. Gem& 1Q:49, Yarrpwia]. 

In the present invention, selectable markers, an origin of replication, and 
homologous host ceiis sequences may be included in an expression vector. A selectable 
marker can be used to screen for host cells that potentially contain the expression vector. 
Such markers include those that render a host cell resistant to drugs such as ampicillin, 
chloramphenicol, erythromycin, neomycin, and tetracycline. Also, the markers may 
include biosynthetic genes, such as those in the histidine, tryptophan, and leucine 
pathways that are required foe growth of the host cell. Thus, when a leu(-) host cell is 
used a recipient in transformation with an expression vector, and leucine is absent from 
the media, for example, only the cells that carry a plasmid with a leu(+) gene will 
survive. 

In the present invention, an origin of replication may be incorporated into an 
expression vector to allow autonomous replication in the host cell. Such origin of 
replication includes those that enable an expression vector to be reproduced at a high copy 
number in the presence of the appropriate proteins within the cell, for example the 2fi and 
autonomously replicating sequences, that are effective in yeast; and the origin of 
replication of the viral T-antigen, that is effective in COS-7 cells. 

For the purpose of the present invention, expression vectors may either be 
integrated into the host cell genome or remain autonomous within the cell. For 
integration into the host genome, the expression vector herein may include polynucleotide 
sequences that are homologous to sequences within the host cell genome. The 
homologous sequences need not to linked to the expression vector. [ASK IF THIS IS 
TRUE For example, expression vectors can integrate into the CHO genome via an 
unattached dihydrofolate reductase gene.] In yeast, it is preferred that the homologous 
sequences flank the expression cassette. Particularly useful homologous yeast genome 
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sequences for the present invention are those disclosed in PCT WO90/01800, and the 
HIS4 gene sequences, described in Genbank, accession no. J01331. 

The choice of promoter, terminator, and other optional elements of an expression 
vector will also depend on the host cell chosen as is known to a person of ordinary skill in 
the art. This invention is not dependent on the host cell selected. Convenience and the 
level of protein expression desired will dictate the optimal host cell. A variety of hosts 
for expression are known in the art and available from the American Type Culture 
Collection (ATCC). 

For example, bacterial hosts suitable herein for expressing the KGF fragment or 
analog include: Campylobacter, Bacillus, Escherichia, Lactobacillus, Pseudomonas, 
Staphylococcus, and Streptococcus. Yeast hosts from the following genera may be 
utilized: Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizo- 
saccharomyces, and Yarrowia. Immortalized mammalian cells that may be used as hosts 
herein include CHO cells, HeLa cells, baby hamster kidney ( n BHK w ) cells, monkey 
kidney cells ("COS"), and human hepatocellular carcinoma cells, e.g., Hep G2. A 
number of insect cell hosts are also suitable for expression of the KGF fragment or 
analog, including: Aedes aegypti , Autographa californica, Bombyx mori, Drosophila 
melanogaster, and Spodoptera frugiperda as described in PCT WO 89/046699; Carbonell 
et aL, J. Virol. 55:153 (1985); Wright Nature J2/:718 (1986); Smith et al., MoL CelL 
Biol. i:2156 (1983); and generally, Fraser, et al. in vitro Cell Dev. BioL 25:225 (1989). 

The expression vector containing the KGF fragment or analog is inserted into the 
host cell. Any transformation techniques that are known in the art can be used for 
inserting expression vectors into the host cells. For example, for transformation of 
bacterial hosts typically includes first treating the bacteria with either CaCl 2 or other 
agents, such as divalent cations and DMSO and allowing the exogenous DNA to be 
introduced to [WHAT??] with the treated bacterial cells. DNA can also be introduced 
into bacterial cells by electroporation or viral infection. Transformation procedures for 
bacterial hosts that can be used herein include those described in Masson et aL FEMS 
Microbiol. Lett. 60:273 (1989); Palva et aL Proc. Natl. Acad. Sci. USA 72:5582 (1982); 
EP Publ. Nos. 036 259 and 063 953; PCT WO 84/04541, Bacillus), Miller et al. Proc. 
Natl. Acad. Sci. 85:856 (1988); Wang et aL J. Bacterid. 172:949 (1990), 
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Campylobacter), Cohen et aL Proc. Natl. Acad. Sci. £2:21 10 (1973); Dower et al. 
Nucleic Acids Res. 16:6127 (1988); Kushner "An improved method for transformation of 
Escherichia coli with ColEl-derived plasmids in Genetic Engineering: Proceedings of the 
International Symposium on Genetic Engineering (eds. H.W. Boyer and S. Nicosia) 
(1978); Mandel et al. J. Mol. Biol. 52:159 (1970); Taketo Biochim. Biophvs. Acta 
949:318 (1988); Escherichia, Chassy et al. FEMS Microbiol. Lett. 44:173 (1987) 
Lactobacillus; Fiedler et aL Anal. Biochem 170:38 (1988), Pseudomonas; Augustin et al. 
FEMS Microbiol. Lett. 66:203 (1990), Staphylococcus, Barany et al. L Bacteriol. 
1M:G98 (1980); Harlander "Transformation of Streptococcus lactis by electroporation," in 
Streptococcal Genetics (ed. J. Ferretti and R. Curtiss III) (1987); Perry et aL Infec. 
Immun. 22:1295 (1981); Powell et aL Appl. Environ. Microbiol. 54:655 (1988); Somkuti 
et al. Proc. 4th Evr. Cong. Biotechnology 1:412 (1987), Streptococcus. 

Transformation methods for yeast hosts are well-known in the art, and typically 
include transformation of either spheroplasts or intact yeast cells treated with alkali 
cations. Yeast hosts can also be transformed by electroporation as described in Methods 
in Enzymology, Volume 194, 1991, "Guide to Yeast Genetics and Molecular Biology." 
For the present invention, the transformation procedure to apply vary with the yeast 

species to be transformed and include those described in Kurtz et aL Mol. Cell. Biol. 

6:142 (1986); Kunze et aL J. Basic Microbiol. 25:141 (1985); Candida; Gleeson et aL L 

Gen. Microbiol. 132:3459 (1986); Roggenkamp et al. Mol. Gen. Genet. 202:302 (1986); 

Hansenula; Das et aL J. Bacteriol. 158:1165 (1984); De Louvencourt et aL J. Bacteriol. 

154:1165 (1983); Van den Berg et aL Bio/Technology g:135 (1990); Kluyveromyces; 

Cregg et al. Mol. Cell. Biol. 5:3376 (1985); Kunze et aL J. Basic Microbiol. 25:141 

(1985); U.S. Patent Nos. 4,837,148 and 4,929,555; Pichia; Hinnen et at. Proc. Natl. 

Acad. Sci. USA 75; 1929 (1978); Ito et aL J. Bacteriol. 153:163 (1983) Saccharomyces; 

Beach and Nurse Nature 30Q:706 (1981); Schizosaccharomyces; Davidow et aL Curr. 

Genet. 10:39 (1985); Gaillardin et aL Curr. Genet. 10:49 (1985); Yarroma. 

Methods for introducing heterologous polynucleotides into mammalian cells are 

known in the art and include, for example, viral infection, dextran-mediated transfection, 

calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
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electroporation, encapsulation of the polynucleotide^) in liposomes, and direct 
microinjection of the DNA into nuclei. 

Methods for introducing heterologous DNA into the baculovirus virus to form an 
expression vector and for transformation of an insect host cell are also known in the art as 
5 described in Smith et al. t Mol. Cell. Biol. 2:2156 (1983); and Lucklow and Summers, 
Virology Hi 31 (1989). For example, the KGF fragment DNA can be inserted into the 
polyhedron g^ne by homologous double crossover recombination. Insertion can also be 
made at a restriction enzyme site engineered into the desired baculovirus gene, as 
described in Miller et al. f Bioessavs 4:91 (1989). The DNA sequence of the KGF 

10 fragment when cloned in place of the polyhedron gene in the expression vector, is flanked 
both 5' and 3* by polyhedron-specific sequences and is positioned downstream of the 
polyhedron promoter. 

In the embodiment of the present invention, the newly formed baculovirus 
expression vector can be subsequently packaged into an infectious recombinant 

15 baculovirus. In the baculovirus expression system, homologous recombination occurs at 
low frequency, between about 1% and about 5%. Thus, usually, the majority of the vims 
produced after transfection is still wild-type virus. Recombinant viruses, however, can be 
identified by known methods. For example, the native virus produces polyhedron protein 
at very high levels in the nuclei of infected cells during a late stage of viral infection. 

20 Accumulated polyhedron protein forms occlusion bodies that also contain embedded viral 
particles. These occlusion bodies, up to 15 /xm in size, are highly refractile, giving them 
a bright shiny appearance that is readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies. Recombinant virus and wild-type 
virus can be distinguished by plating, the transfection supernatant or dilutions thereof is 

25 onto a monolayer of insect cells by standard techniques. The plaques can then be 

screened under the light microscope for the presence, indicative of wild-type virus, or 
absence, indicative of recombinant virus, of occlusion bodies as described in "Current 
Protocols in Microbiology" Vol. 2 (Ausubel et al. eds) at 16.8 (Supp. 10, 1990). 

Immunoassays and activity assays that are know in the art can be utilized to 

30 determine if the transformed host cells herein are expressing the desired KGF fragment. 
For example, an immunofluorescence assay can be performed on the transformed host 
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cells without separating the KGF fragments from the cell membrane. In this assay, the 
host cells are first fixed onto a solid support, such as a microscope slide or microtiter 
well. Next, the fixed host cells are exposed to an anti-KGF antibody. Preferably, to 
increase the sensitivity of the assay, the fixed cells are exposed to a second antibody, that 
is labelled and binds to the anti-KGF antibody. For instance, the secondary antibody may 
be labelled with an fluorescent marker. The host cells which express the KGF fragments 
will be fluorescently labelled can be visualized under the microscope. 
Example 1 

A human KGF cDNA was inserted into a baculovirus expression system by 
cloning the cDNA into pAcC13Pst/Not 1 expression vector as shown in FIG. 1. This 
plasmid was derived from pAcC12, as described in Munemitsu et at. MoL Cell. BioL 
10:5977-5982 (1990). 

PCR oligonucleotide primers were designed for the truncated 18 kd KGF based upon the 
available N-terminal amino acid sequence of KGF as described in Finch et al. Science 
2:752-755 (1989). Flanking restriction sites (Pstl and Not!) were incorporated into the 
primers to facilitate subcloning into the pAcC13 expression vector, a derivative of the 
pVL941 transfer vector as described in Luckow et at. Virology 770:31-39 (1989) and 
Quilliam et at. MoL Cell. BioL 70:290102908 (1990). This expression vector was 
constructed by insertion of the KGF-encoding fragment into the PstlJNotl polylinker site 
of pAcC13. The KGF cDNA encoded the mature processed form of KGF, which was 
composed of 163 amino acid residues, with a potential N-glycosylation site at residues 14 
to 16. 

Preparation of condition medium 

Spodoptera Frugiperda, Sf9 insect cells, were infected with baculovirus, 
Autographa Californica, containing cDNA for KGF M63 , After being diluted to [X x 10*] 
cells/ml, the cells were cultured for 48 to 72 hrs in Excell-400 in the absence of serum 
supplement, antibiotics, or fungicides. Culture fluid was collected and centrifuged at 
10,000 x g for 30 minutes to remove floating cells and other cell debris. The condition 
medium was then filtered with an 0.8 /z-filter (Millipore) and approximately 5 liters of this 
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media was concentrated to 200 ml using a filtron cassette system (Omega membrane) 
having a 3-kDa molecular weight cut-off. 

After concentration, the conditioned medium from the transfected Sf9 cells were 
collected by centrifugation at 10,000 x g for 20 minutes and the medium was purified by 
sequential heparin Sepharose ("HS") affinity chromatography. In one embodiment of the 
invention, approximately 1 liter of Sf9 cells conditioned medium was ultra-filtered to 
produce 200 ml of ultrafiltration retentate, using an Omega membrane (Filtron) having a 
cut-off of 3-kDa. 

HSAC and Mono S cation exchange chromatography 

The supernatant was adjusted to pH 7.2. A 30 ml sample was loaded onto a 
heparin Sepharose resin that had been equilibrated in 10 mM Tris-HCl pH 7.3, 150 mM 
NaCl. The resin was washed extensively with equilibration buffer until the absorbance 
had returned to baseline and was then eluted stepwise with increasing NaCl concentrations 
from 0.45 M to 1 M and 2 M NaCl. Aliquots were removed from the fractions for use in 
cell proliferation assays and 1 M NaCl fractions with the highest bioactivity were pooled, 
the pooled fractions were diluted five-fold with 10 mM Tris pH 7.2 (final salt 
concentration 0.2 M NaCl), and the sample was applied with a Super loop onto a Mono S 
column linked to a FPLC system (Pharmacia, Piscataway, NJ). Elution was achieved 
with a linear gradient (10 mM Tris pH 7.3, 0.2 M NaCl to 10 mM Tris pH 7.3, 1 M 
NaCl). After fraction the aliquots were tested for bioactivity, the active fractions were 
pooled. 

The retentate was adjusted to pH 7.3 and loaded onto a HS column containing 
approximately a 30 ml bed volume. The column was allowed to run for approximately 2 
hours at 4°C. The column was washed with 150 ml of equilibration buffer containing 10 
mM Tris-HCl and 0.15 M NaCl at pH 7.3. The retained protein was eluted with 0.45 M 
NaCl and 1 M NaCl. The flow rate of the column during elution was adjusted to 
approximately 90 ml/hr and 3 ml size fractions were collected. 

The bioactive 1 M NaCl fractions of the HS affinity chromatography steps were 
pooled and diluted 5-fold with 10 mM Tris pH 7.3 and directly loaded on a Mono S -HR 
5/5 column (Pharmacia). The retained proteins were eluted using a 0.2 M NaCl to 1 M 
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NaCl gradient. Bioassays of the active fraction was determined using the Balb-Mk cell 
line. SDS PAGE analysis of the bioactive protein fractions isolated by Mono S cation 
exchange chromatography was performed. Fractions 37-39 and fractions 41-42 were 
pooled and 10 ul aliquots were added to SDS and 10 mM DTT. These samples were heat 
denatured and electrophoresed in a 12% polyacrylamide gel which was subsequently silver 
stained. A similar migration pattern was observed whether the samples were run in a 
reduced environment or a non-reduced environment. 25 Kd was the apparent molecular 
weight of the protein contained in fractions 37-39 and 18 kd was the apparent molecular 
weight of the protein contained in fractions 41-42. 

The chromatogram of the HS bioactive fraction shows the profile of Balb/Mk 
bioactivity covering fractions 31 to 47 demonstrating the presence of 2 molecular species, 
one eluting at 0.55 M NaCl, and the other eluting at 0.6 M NaCl. 

KGF activity determination 

The presence of KGF activity in the obtained fractions was assessed by the ability 
of the fractions to promote the growth of BALB/C-Mk cells. In this regard, 10 jil 
aliquots of certain fractions were diluted in 1 ml of 0.2% gelatin in phosphate buffered 
saline ( H PBS H ), and 10 /xl of the diluted fractions were tested for growth stimulatory 
activity of BALB/C-Mk cells seeded in 12-well cluster plates, each containing 22 mm 
wells, at 5 x 10 3 cells per well. All of the KGF activity was found to be retained in the 
column and was eluted with 1 M NaCl. 

The KGF bioactive fractions eluted from the HS column with 1 M NaCl were 
further purified by cation exchange FPLC column chromatography. These fractions were 
pooled, diluted 5-fold with 10 mM Tris, at pH 7.3, and directly loaded on a Mono S HR 
5/5 column from Pharmacia. 

The protein retained in the Mono S HR 5/5 column was eluted with a 0.2 M 
NaCl to 1 M NaCl gradient. The bioactivity of the eluted fractions was assayed as 
described as above using BALB/C-Mk cells. Figure 3 illustrates the activity profile 
covering fractions 31 to 47. 

In the mitogenic assay, 10 4 ABAE and 5 x 10 3 ACE cells per well were seeded 
in 12 well cluster plates at a density of 5 x 10 3 cells per well in 1 ml DMEM 
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supplemented with 10% calf serum and antibiotics as described Bohlen et al. EMBO 
4:1951-1956 (1985), and Gospodarowicz et al. J. Cell Physiol 727:121-136 (1986), and 
Bellosta et al. J. Cell Biol 727:705-713 (1993). After a six hour incubation, a set of 
triplicate wells were trypsiniz&d and the cells were counted to determine the plating 
efficiency. Ten microliter aliquots of the appropriate dilution of each sample were then 
added in triplicate to wells in the dishes on day 0, day 2, and day 4. After the 5th day in 
culture, the plates were trypsinized and cell densities were determined with a Coulter 
counter (Coulter Electronics, Hiaieah, FL). 

Addition of KGF or basic FGF were done on day 0 and every other day at the 
indicated concentration. After 5 days in culture, the cells were trypsinized and final cell 
density was determined using a Coulter counter. 

Electrophores is (NaDodSOJPAGE). 

Polyacrylamide gels were prepared with NaDodS0 4 following the procedure as 
described in Laemmli et al. Nature 227:680-685 (1970). Samples were boiled for 3 min 
in the presence of 10 mM DTT and electrophoresed in a 12% polyacrylamide gel. The 
gels were fixed and silver stained using the reagents and protocol from BioRad and as 
discussed in Merril et al. Science 277:1437-1438. Appropriate molecular weight markers 
were from Biorad. 

Cell proliferation assay 

The mitogenic activity of column fractions and purified samples were determined 
by using Balb/Mk cells as target ceils. Stock cultures were grown and maintained in low 
calcium modified Eagle medium supplemented with 10% FCS, 50 ng/m\ gentamicin, 0.25 
/ig/ml fungizone, and 10 ng/ml aFGF as described in Gospodarowicz et aL J. Cell 
Physiol 742:325-333. Mitogenic assays cells were seeded in 12 wells cluster plates at a 
density of 5 x 10 3 to 1 x 10 4 cells per well in 1 ml low calcium MEM supplemented with 
10% FCS as set forth in Gospodarowicz et al Sample additions and final cell density 
determinations were made after 5 days in culture was performed as described in 
Gospodarowicz et al 
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The mitogenic activity of the final purified material was tested on adult bovine 
aortic endothelial cells ("ABAE") and adrenal cortex-derived capillary endothelial cells 
("ACE cells") as described in Gospodarowicz et al. Proc. Natl Acad. Sci. 73:4120-4124 
(1976). Stock cultures w#e maintained in the presence of DMEM supplemented with 
10% CS, 50 fig/ml gentamicin, and 0.25 /xg/ml. Fungizone was passaged weekly on 
gelatinized tissue culture dishes at a split ratio of 1:10. 

Using standard methodology well known in the art, an unambiguous amino acid 
sequence was established for position 1 to 20 from the NH 2 terminus of KGF^,.^ as 
follows: 

S, YD Mj EGG DIRV RRLFXRTQ 

The present invention also includes DNA segments encoding ICGF^.^ and other 
shorter form of KGF lacking the unconsented NH 2 terminal characteristic of KGF I63 
versus other members of the FGF family. 

Protein microsequencing 

Two nominal 100 picomole samples of the KGF 163 and KGF^.jj were analyzed 
by Edman degradation and AA analysis after centrifugal adsorption to polyvinylidene 
difluoride (PVDF, Applied Biosystems Biospin). The samples were loaded onto an 
Applied Biosystems 477A gas-phase protein sequenator. Twenty rounds of Edman 
degradation were carried out using standard software and chemicals supplied by Applied 
Biosystems, and identifications of PTH amino acids were made with an automated on-line 
HPLC column (model 120A, Applied Biosystems). 
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What is Claimed : 

1. A keratinocyte growth factor fragment comprising a portion of an amino acid 
sequence of mature, full-length keratinocyte growth factor, wherein the portion possesses 
at least a 2-fold increase in mitogenic a-^vlty as comparrxi to a mature- r recombinant, full- 
length keratinocyte growth factor, and lacks a sequence comprising the first 23 N-terminus 
amino acid residues of the mature, full-length keratinocyte growth factor. 

2. The keratinocyte growth factor fragment as claimed in claim 1, wherein the 
fragment possesses a 7-fold increase in mitogenic activity as compared to the mature, 
recombinant, full-length keratinocyte growth factor. 

3. The keratinocyte growth factor fragment as claimed in claim 1, wherein the 
fragment possesses a 10-fold increase in mitogenic activity as compared to the mature, 
recombinant, full-length keratinocyte growth factor. 

4. The keratinocyte growth factor fragment as claimed in claim 1, wherein the 
fragment exhibits decreased cytotoxicity as compared to the mature, recombinant, full- 
length keratinocyte growth factor. 

5. A conjugate comprising: 

(a) a keratinocyte growth factor fragment that comprises a portion of an amino 
acid sequence of mature, full-length keratinocyte growth factor, wherein the 
portion possesses at least a 2-fold increase in mitogenic activity as compared to 
the mature, recombinant, full-length keratinocyte growth factor and lacks a 
sequence comprising the first 23 N-terminus amino acid residues of the mature, 
full-length keratinocyte growth factor, and 

(b) a toxin molecule. 

6. The conjugate of claim 5, wherein the toxin molecule is selected from the 
group consisting of ricin A, diphtheria toxin, and saporin. 

7. The conjugate of claim 5, wherein the fragment possesses a 7-fold increase in 
mitogenic activity as compared to the mature, recombinant, full-length keratinocyte 
growth factor. 

8. The conjugate of claim 5, wherein the fragment possesses a 10-fold increase in 
mitogenic activity as compared to the mature, recombinant, full-length keratinocyte 
growth factor. 
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9. A therapeutic composition comprising: 

(a) a keratinocyte growth factor fragment that comprises a portion of an amino 
acid sequence of mature, full-length keratinocyte growth factor, wherein the 
portion possesses at least a 2-fold increase in autogenic activity as compared to 
the mature, recombinant, full-length keratinocyte growth factor but lacks a 
sequence comprising the first 23 N-terminus amino acid residues of the mature, 
full-length keratinocyte growth factor, and 

(b) a pharmaceutically acceptable carrier. 

10. A DNA molecule comprising a nucleotide sequence that encodes the 
keratinocyte growth factor fragment of claim 1. 

11. A DNA molecule comprising a nucleotide sequence that encodes the 
keratinocyte growth factor fragment of claim 2. 

12. A DNA molecule comprising a nucleotide sequence that encodes the 
keratinocyte growth factor fragment of claim 3. 

13. An expression vector comprising the DNA molecule of claim 10 and a 
regulatory sequence for expression of the DNA molecule. 

14. The expression vector as claimed in claim 13, wherein the vector is a 
baculovirus. 

15. A host cell transformed with the expression vector of claim 13. 

16. The host cell as claimed in claim 15, wherein the cell is selected from the 
group consisting of a bacterial cell, a yeast cell, a mammalian cell and an insect cell. 

17. A method of producing a keratinocyte growth factor fragment comprising the 
steps of culturing the host cell of claim 15, and isolating the keratinocyte growth factor 
fragment from the culture. 

18. A method for wound healing comprising applying the therapeutic composition 
of claim 9 to an area of a wound to be treated and allowing the wound to heal. 

19. A method of treatment of a hyperproliferative disease of the epidermis 
comprising applying the conjugate of claim 5 to an area to be treated. 

20. The method of treatment as claimed in claim 20, wherein the disease is one 
selected from the group consisting of psoriasis and basal cell carcinoma. 
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Long Form Start CHO Site 

1 5 10 -IS 

i CYS-ASN-ASP-MET-THR-PRO-^U-GLN-MET-ALA-THR-ASN-VAL-ASN-CYS- 
I — > 

Short: Form Start 



16 



20 



25 



30 



SER-SER-PRO-GLU-ARG-BIS-TBR-ARGfSER-TYR-ASP-TYR-^T-GLU-CLY- 

31 35 40 45/ 

GLY— ASP — I LE— ARG— VAL— ARGt. ARG-LEU**P HE-*CYS— ARG— TRR-G LN -TRP— TYR- 

46 50 5S 60 

LEU— ARG-I1^— ASP— LYS-ARG-GLY— LYS-VAL-LYS-GLY-THR—GLN-GLU-MET— 

61 65 70 75 

LYS-ASN-ASN-TYK-ASN-ILE-MET-GLU-ILE-ARG-TRR-VAL-ALA-VAL-GLY- 

76 80 85 90 

ILE-VAL-ALA-ILE-LYS-GLY«-VAL-GLU-SER-GLU-PH£-TYR-LEU-ALA-MET- 

91 95 100 105 

ASN— LYS— GLUH3LY— LYS— LEU-TYR-ALA-LYS-LYS— GLU— CYS— ASN-GLU— ASP- 

106 110 US 120 

CYS-ASM-PHE-LYS-<3LU-LEU-ILE-L£U-GLU-ASN-HIS-TYR-ASN-TaR-TYR- 

121 125 130 135 

ALA— SER— 7\I^-LYS— TRP— THR-BIS-ASN-GLY-GLY— GLU-MET-PRE-VAL— ALA- 

136 140 145 150' 

LEU— ASN— GLN-LYS— GLY-ILE-PRO— VAL— ARG— GLY— LYS-LYS— THR— LYS— LYS~ 

151 155 160 

GLU-GLN-LYS-THR-AlJi-HIS-PHE-IXU-PRO-^T-ALA-ILE-TnR- 
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